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METHOD FOR FORECASTING CONSUMPTION AND GENERATING OPTIMAL 
DELIVERY SCHEDULES FOR VEHICLES INVOLVED IN DELIVERING PROPANE 
AND OTHER CONSUMABLES TO END CONSUMERS 



CROSS REFERENCE TO RELATED APPLICATIONS 

[0001] This application claims benefit of United States Provisional Application Number 

60/418,780 filed October 15, 2002. 

FIELD OF THE INVENTION 

[0002] The invention concerns an improved forecasting and routing system for 

scheduling propane and other consumable deliveries to end consumers by one or more capacity 
limited trucks in a multi-stop environment. 

BACKGROUND OF THE INVENTION 

[0003] Techniques are known for optimizing vehicle routing to replenish inventories in 

the case of the delivery of consumable products or periodic services using trucks in a multi-stop 
environment. Industry practices in these areas are set forth below. 

Inventory Replenishment-Delivery Economics: 

[0004] In general, consumption parameters provided by time series forecasting modeling 

modules calculate expected inventory levels for every day since the last delivery. The expected 
run out date will occur when the cumulative expected consumption has exhausted the last 
recorded inventory, which is usually captured at the last delivery. If there were no cost penalty 
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to exhausting inventories, one would wait for that event before triggering replenishment. That 
would minimize the number of deliveries. 

[0005] However, since substantial penalties are incurred for exhausting inventories (e.g. 

running out of propane gas), a classic inventory calculation trading off increased lot 
size/(decreased annual unit shipping costs) and expected stock out costs/(out of gas penalties) is 
needed to determine the optimal delivery day (ODD) which therefore occurs sometime before 
the expected run out day. The smaller the variance (error in predicting consumption) the closer 
the ODD is to the expected run out day and the steeper the trade off curve. The techniques used 
in this calculation have long been utilized for inventory reordering in a variety of industrial 
settings. Typically, the ODD will further be advanced by an additional number of days of 
reserve, which is the lead time within which a delivery is reasonably assured (e.g. allowing time 
for a second delivery if the first one failed for any reason). To handle special cases of deliveries, 
the system will also allow customer specified delivery days that override any statistically 
determined optimal delivery day. 

Optimal Vehicle Routing: 

[0006] Deliveries, however, cannot be considered independently because they must be 

combined with other deliveries on a route. To achieve this, commercially available optimal 
vehicle routing (OVR) packages, which have been readily available for many years, can be used. 
The basic purpose of OVR packages is to minimize the mileage and travel time costs associated 
with delivering a number of predetermined loads subject to a vehicle's carrying capacity (e.g. 
volume, weight) and allowable time on the road. OVR package functionality details often vary. 
However, all commercial OVR systems assume that deliveries must be made according to a 
schedule predetermined in the input. So a delivery required for a given day will be assigned to 
routes specified for a given day or a delivery which is free to be scheduled on several days will 
be free to be assigned to routes on those days. In other words, OVR systems consider only the 
geographic efficiencies of the routes generated to guide the assignment and cannot deal with 
additional complications inherent in delivery of consumable commodities, such as: 

(1) OVR packages cannot account for the variation in delivery quantity that would occur if the 
delivery day were to change, i.e. delivering a time consumable commodity such as propane or 
fuel oil earlier would result in a smaller load and therefore allow more vehicle capacity for other 
deliveries and vice-versa. 

(2) OVR packages cannot account for the day dependent expected cost of an individual delivery 
that are derived from inventory replenishment cost trade-offs. 
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[0007] There has been some experimentation in the propane industry with the use of 

OVR packages but because of the two complications identified above, these exercises have 
assumed very conservative service policies on delivery day selection (to avoid out of gas 
conditions). Over time these conservative policies result in the generation of significantly more 
customer deliveries than would be generated if both temporal (delivery day selection) and 
geographic (delivery sequencing) dimensions were optimized concurrently. Such optimization is 
implemented in accordance with the invention. 

SUMMARY OF THE INVENTION 

[0008] The invention is an improved scheduling and routing system for deliveries of 

propane and other consumables and periodic services via trucks with finite capacity that provides 
the timing, sequence, quantities, choice of replenishing locations and vehicle selection for 
deliveries. In accordance with a first feature of the invention, software estimates the 
consumption of customers with limited delivery history by using their individual characteristics 
(e.g., tank size) to match them with a pool of similar customers whose consumption can be 
statistically determined from their delivery history. In accordance with a second feature of the 
invention, software combines the specific over the road routing economics with the delivery 
economics for delivering propane to maximize the rate of delivery in gallons per hour. These 
two features in combination with existing industry practices in inventory replenishment and 
optimal vehicle routing form the components of the invention. 

Forecasting Consumable Consumption 

[0009] The invention is designed for use in cooperation with a computer having memory 

and is assumed to interface with external administrative information systems that capture 
necessary input data for analysis. Among the critical items captured are (1) customer 
information including delivery location, storage tank size for the consumable, consumption type 
(e.g., for propane, consumption types include: heating, cooking, swimming pools, and 
combinations of the above) and (2) customer consumable delivery history (dates & quantities). 
For energy based consumables such as propane or heating oil, also captured is the weather 
history (temperatures or degree-days) and weather forecasts in terms of degree-days, a 
commonly used metric in the propane and fuel oil industries. The system uses the above inputs 
to model consumption of the consumable at the customer level. In this model, both the expected 
consumption and its variance must be derived to optimally schedule deliveries. 

[0010] For propane, relatively short time series (0 to 20 deliveries) characterize 

consumption at the customer level; therefore, exogenous information (e.g., type of usage, tank 
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size, size of residence, periods when no consumption is known to occur) is used to guide the 
statistical analysis. The exogenous information becomes progressively less important with the 
length of the time series and consistency of consumption of the consumable. The system 
recognizes that the reliability of exogenous information is not certain. 

[0011] For example, statistical analysis of 20 deliveries (observations) may derive better 

forecasts without the use of exogenous information. However, with less than ten deliveries, one 
almost always needs exogenous information to reduce the degrees of freedom that must be 
statistically resolved. In cases of less than 3 to 5 deliveries, forecasting is totally dependent on 
the use of exogenous characteristics to link a given customer to a cluster of other customers for 
whom consumption parameters have been reliably calculated. These clusters would typically 
include customers attached to the geographically local business unit but may also be derived 
from customers external to that unit, particularly if the local unit is just initiating operations. 

[0012] At the conclusion of the forecasting phase each customer has one to three 

consumption parameters (expected time dependency, degree day dependency, seasonal 
dependency) necessary to forecast its mean consumption and a variance term. The invention 
thus provides a seamlessly integrated multiple-technique forecasting approach to short time 
series analysis. 

Risk Managed Routing for Consumable (Integration of Routing & Forecasting): 

[0013] The principal outputs of the cost trade off calculation derived from the forecasting 

and inventory replenishment optimization is the ODD, and a table of expected delivery quantities 
by day, as well as the expected cost penalties that would be incurred by advancing or postponing 
deliveries from the ODD. The invention extends the cost optimization for individual deliveries 
for consumable replenishment with the road routing economics of making several deliveries in 
sequence. The system accomplishes this by tightly integrating OVR algorithms for optimal route 
generation with the algorithms for optimal delivery day (ODD) selection, to allow a global 
optimization with cost trade offs between out of consumable penalties, the costs of multiple trips 
to deliver the consumable over the long term for an individual customer, and the daily costs of 
time and miles on the road for executing a route to deliver the consumable to a number of 
different customers. Therefore, in this invention, the total cost function, which is minimized, 
includes the routing costs typically considered by OVR algorithms plus the marginal cost 
penalties of shifting a delivery away from its ODD. 

[0014] Specifically, the days on which deliveries can be made are variable (with the 

exception of those which are a priori restricted) thereby greatly increasing the number of 
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possible route assignments for a given delivery. This combinatorial relaxation leads to 
substantially lower cost than the alternative approach, which first applies ODD calculations to 
generate a delivery schedule that remains fixed during the subsequent OVR algorithm 
optimization. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0015] A method for forecasting consumption and generating optimal delivery schedules 

for vehicles involved in delivering propane and other consumable deliverables and periodic 
services to end consumers in accordance with the invention is further described below with 
reference to the accompanying drawings, in which: 

[0016] Figure 1 illustrates the overall system flow of software implementing the 

invention. 

[0017] Figures 2a-2c illustrate the logic of the selection step with the associated pseudo- 

code. 

[0018] Figures 3a-3c illustrate the logic of the RAD calculating step with the associated 

pseudo-code. 

[0019] Figures 4a-4c illustrate the logic of the routing step with the associated pseudo- 

code. 

[0020] Figures 5a-5c illustrate the logic of the audit performance step with the associated 

pseudo-code. 

[0021] Figure 6 illustrates the overall process of the modeling step in accordance with the 

invention. 

[0022] Figures 7a-7c illustrate the logic of the clustering step with the associated pseudo- 

code. 

[0023] Figures 8a-8d illustrate the logic of the consumption-matching step with the 

associated pseudo-code. 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

[0024] A particular embodiment of the invention is described below in detail and is 

intended to incorporate all alternatives, modifications and equivalents as to implementation on 
particular choices of computing hardware, software, or network platforms. The pseudo-code 
shown below can be implemented with a variety of appropriately functional procedural 
programming languages including C++, Visual Basic or Java, combined with any of a number of 
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statistical libraries such as SAS, Wolfram Research and NAG. The choice of operating systems 
and database management systems is open as well. The algorithms can be deployed on single or 
networked workstations, on distributed or centralized processors. Such variations are intended to 
be covered by the claims defining the invention. 

[0025] Of particular interest is the utilization of commercially available OVR packages 

for user interaction and display purposes. There are number of these packages commercially 
available. Furthermore, the OVR optimization algorithms can be employed to greatly simplify 
the optimization coding required by the system as a whole. The nature of this adjustment will 
also be described below. 

[0026] A presently preferred implementation of the system of the invention utilizes the 

Windows operating system operating on a Pentium 4 or greater chip. It utilizes C++, Visual 
Basic for Applications (VBA) and the Microsoft ACCESS database manager. It also utilizes 
SAS for statistical analysis. Finally, it preferably interfaces to the OVR package offered by 
ESRI called Arc Logistics Route (ESRI™). Those skilled in the art will appreciate that other 
hardware and software configurations available on the market also may be used. 

Introduction 

[0027] Those skilled in the art will appreciate that the "consumable" delivery described 

herein may include propane fuel deliveries, fuel oil deliveries, automatic teller machine (ATM) 
replenishment, field equipment preventive maintenance and other industrial deliveries of 
products that require periodic replenishment or services that require periodic scheduling. The 
propane delivery application is described in detail in an exemplary embodiment below as it is 
one of the more complex applications. The approach enumerated for propane deliveries may be 
simplified for other embodiments of the invention for other consumable delivery or periodic 
service delivery applications. 

[0028] The implementation of the invention includes two phases, one phase with three 

steps, and the other phase with two, which are run cyclically on a daily basis. As shown in 
Figure 1, the first phase is a planning phase 100. It occurs prior to the dispatching of a fleet for 
deliveries (execute routes step 200), typically in the early morning. It includes a selection step 
1 10, a risk evaluation step 120, and a routing step 130. The second phase is an assessment phase 
300. It occurs after the return of the fleet from their delivery runs, typically in the evening. It 
includes an operational audit step 310 and a forecasting step 320. 
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[0029] Figure 1 shows the daily workflow among these steps and the following pages 

describe the processes involved in each step with reference to the more detailed diagrams shown 
in Figure 2 through Figure 8. Additional reference is made to pseudo-code, which provides the 
inputs, outputs and logic for each step. 

Planning phase 

1. Selection: 

[0030] The logic of the selection step 110 is shown in Figure 2a with the associated 

pseudo-code in Figures 2b and 2c. 

[0031] In this step, three contiguous and successive time horizons are defined: 

• First, a routing horizon over which customers are routed; 

• Second, a safety (lead time) horizon which specifies an arbitrary safety level that 
deliveries must respect (i.e., if customers are delivered with 14 days of safety the average 
inventory at delivery time will be 10.5 days before run out); and 

• Third, a long-term horizon needed to take advantage of available truck capacity. 

[0032] At step 400, the degree-day forecast, last delivery specification, and forecasting 

parameters are read. The routing, safety and long-term horizon are also read in that step and used 
to calculate the overall planning horizon. The overall planning horizon is defined as the sum of 
all three horizons. Given this planning horizon, the customer selection process begins at step 402 
by evaluating a cost trade-off function specific for each customer using the forecasting 
parameters calculated in the modeling step. The cost trade-off function reflects the trade-off 
between the cost of delivery and the cost associated with the risk of out-of-gas (OG) on any 
given day. Since this function over time is convex it exhibits a single minimum that is used as 
the criterion for selecting the day for a customer delivery. 

[0033] Specifically, the availability of this cost function allows calculation, after 

initialization at step 404, of the total expected cost of delivery to each customer on every day of 

the planning horizon (step 406). This total expected cost is the sum of the delivery cost (which 

decreases with time because the delivery quantity will increase with time) and the expected 

stock-out cost (which increases with time as the inventory gets closer to exhaustion). A running 

minimum is determined at step 408 until the last day is reached (step 410). The day on which 

this total expected cost is minimal is the ODD (optimal delivery day) (step 412). This process is 

repeated for each customer (steps 414 and 416) until the cost has been calculated for all 

customers. As a rule, every customer for whom the cost trade-off function reaches its minimum 
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(ODD) within the planning horizon is selected for routing. In addition, all customers whose 
ODD fall within the long-term horizon should be assigned a lower priority than the one assigned 
to the customers whose ODD fall within the routing and the safety horizon. 

[0034] The specifics of the selection depend on the level of geographical and operational 

constraints imposed to the routing, and on the service policy. 

a. The constrains are defined by the combination of geographical and temporal limitation 
imposed on the routing. Four scenarios are possible: 

(1) A geographical area may be assigned to a particular vehicle active on a particular day 
of the week. 

(2) A geographical area may be assigned to any vehicle active on a particular day of the 
week. 

(3) A geographical area may be assigned to a particular vehicle active on any day of the 
week. 

(4) A geographical area may be assigned to any vehicle active on any day of the week. 

b. The service policy dictates that a customer should be afforded two chances at being delivered 
to avoid contingencies due to bad weather, and vehicle or operator availability. This service 
policy dictates the length of the safety horizon. Therefore, in scenario 1 and 2, which operates on 
a weekly cycle, this policy constrains the selection of customers to be triggered within a 14-day 
safety horizon. For scenario 3 or 4, since customers can be visited any day of the week, this 
policy constrains the selection of customers to be triggered within a 2-day safety horizon. 

[0035] Note that the selection of customers can follow other logic such as the imposition 

of a forced cycle or the reaching of the constant storage level (e.g., 20%). However, the 
proposed method, which accounts for both a consumption rate dependent on time and degree- 
days and the variability of that rate, provides the most accurate methodology for the selection 
process. 

2. Risk evaluation: Calculate RAD 

[0036] Once the cost minimum is determined in the selection procedure of Figure 2a, the 

range-of-allowable-days (RAD) calculation is performed (step 418). The logic of this step is 
shown on Figure 3a with the associated pseudo-code in Figures 3b and 3c. 

[0037] As shown in Figure 3a, the safety horizon, planning horizon and routing horizon 

are read at step 500. To place an upper bound on the risk that customers might incur in the 
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timing of their deliveries, a customer specific RAD for delivery is defined and calculated at step 
502. The RAD is used by the routing algorithm to restrict the delivery to these days. 

[0038] The procedure calculates the RAD for the set of customers identified in the 

selection process discussed above with respect to Figure 2a. Initially, the RAD is defined as the 
set of contiguous days within which the cost trade-off function values fall within a given 
tolerance of the optimal (minimum) value. This range of values is called the cost-tolerance-span. 
Because the cost trade-off function depends on many factors (i.e. Cost of delivery, cost of OG, 
yearly demand, storage size, consumption rate and consumption variance) these ranges-of- 
allowable-days will be different for every customer. 
[0039] Then two adjustments are required to this initial RAD: 

a. The RAD must be shifted in time by the amount of safety horizon and modified as follows: 

• If the RAD's minimum falls earlier than the first day of the routing horizon the RAD's 
minimum is replaced by the first day of the routing horizon; 

• If the RAD's maximum falls earlier than the first day of the routing horizon the RAD's 
maximum is replaced by the first day of the routing horizon; 

• If the RAD's minimum falls within the routing horizon and the RAD's maximum falls 
later than the last day of the routing horizon the RAD's maximum is replaced by the last 
day of the routing horizon; 

• If the RAD's minimum falls later than the last day of the routing horizon, the delivery is 
not considered (postponed). { 

As shown in Figure 3a, the adjusted minimum RAD and adjusted maximum RAD are calculated 
at step 504 and adjusted at steps 506 and 508. In general, only those customers with a non- 
empty RAD are retained for routing (i.e. those for which the adjusted RAD's minimum lies 
inside the routing horizon). 

b. The RADs of customers that must be delivered on specific days of the week must be further 
restricted to only those days at step 510. This occurs systematically in cases 1 and 2 defined 
above or, more generally, when customers are "forced" to a specific delivery day(s). For 
instance, in most environments, every customer delivery will be restricted to weekdays (i.e. 
Monday through Friday). If the required day of the week is not available in the calculated 
RAD it override's the calculated RAD. 

[0040] The RAD value is returned at step 512. Note that other assignments methods 

could be used for the definition of RAD. The one proposed above, however, provides the best 
delivery efficiency with the most controlled risk. 
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3. Route 

[0041] The logic of the routing step is shown on Figure 4a with the associated pseudo- 

code in Figures 4b and 4c. 

[0042] To complete the routing procedure, the following input needs to be provided: 

1. A capacity adjustment - which estimates the capacity changes necessary to reflect day-to- 
day variation in the demand. 

2. A truck assignment - which assigns all defined RADs to trucks 

[0043] The truck availability and selected delivery information is read at step 600 and the 

first customer is selected at step 602. To communicate the RAD constraints to the routing 
algorithm, the procedure assigns a string of pre-determined symbols (e.g. "123" or "457" or 
"25") to each customer (the "routing string") at step 604. These strings indicate the days of the 
routing horizon on which a customer can be delivered. For instance the string "123" would 
means that a customer can only be delivered on the first, second or third day of the routing 
schedule. Similarly, under that scheme, the string "457" would mean that a customer could only 
be delivered on the fourth, fifth or seventh day of the routing schedule. Finally the string "25" 
would mean that a customer could only be delivered on the second and the fifth day of the 
schedule. These routing strings are filed at step 606. The process is repeated for all customers 
through steps 608 and 610. 

[0044] Once the "routing strings" are assigned to the orders and compiled at step 612, the 

average demand over the final RAD is assigned to each of the selected customers. This demand 
assignment however poses a problem since the expected demand on any day prior to the day for 
which the average actually occurs will be systematically smaller than the average. Conversely, 
the expected demand on any day after the day the average occurs will be systematically larger 
than the average. These systematic errors will translate in systematic under-capacity early in the 
routing schedule and systematic over-capacity at the end of the routing schedule. To mitigate 
this effect the daily capacity of the trucks needs to be adjusted at step 614. This adjustment is 
calculated for each day of the routing horizon as follows: 

a. For a given day, the total potential demand for that day is calculated using the average 
demand assignment. 

b. For the same day, the total potential demand for that day is estimated using the expected 
demand derived from the forecasting parameters. 
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c. The gallon capacity of the trucks assigned to service the schedule on that day is then adjusted 
by multiplying the truck capacities by the ratio these two demands (Total average/ Total 
expected) for that day. 

[0045] This process is performed for each truck starting at step 616. To complete 

preparation for routing, the routing strings defining the RADs at the customer level must be 
associated with the trucks assigned to the schedule at step 618. This assignment is done in such 
a way that each of the trucks defined as available for a given day is assigned all the string 
symbols that contains that day of operation in the schedule. For instance, in the example above, 
all trucks slated to operate on the second day of the schedule will be assigned the string "123" 
and "25" defined above but not the string "457". Similarly, all truck slated to operate on the fifth 
day of the schedule will be assigned the string "457" and "25" defined above but not the string 
"123". Alternatively, the string "123" will be assigned to all the trucks slated to operate on the 
first, second and third day of the schedule. Similarly the string "457" will be assigned to all the 
trucks slated to operate on the fourth, fifth and seventh day of the schedule and the string "25" 
will be assigned to all the trucks slated to operate on the second and fifth day of the schedule. 
[0046] At the end of this string matching process each truck will be associated with a set 

of routing strings and only those customer deliveries matching a string in that set may be 
assigned to that truck. This process is repeated for all trucks at steps 620, 622, and 624. 
[0047] The routing algorithm follows the logic of the chosen OVR at step 626, which 

accounts for the constraints specified in the customer demand and their RADs. The string 
matching technique that partially reflects those constraints would typically require additional 
programming logic to what is provided in an off-the-shelf OVR package. This invention will 
first be implemented using the ESRI Arc Logistics Route Package wherein the "specialties" 
feature facilitates the string matching process. 

[0048] Once the optimization is completed counter adjustments to the truck capacity 

need to be reversed for reporting purposes. 

Assessment Phase 

4. Audit performance 

[0049] The logic of the audit performance step is shown on Figure 5a with the associated 

pseudo-code in Figures 5b and 5c. 

[0050] This operational audit compares actual and predicted gallons delivered on the day 

when the actual delivery occurred. This provides information about the performance of the 
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forecasting model. It also identifies the customers whose recent delivery history is not consistent 
with the forecasting model. In most cases these exceptions reflect actual changes in customer 
behavior that can be recorded and appropriately dealt with. 

[0051] This step can also measure the risk position of customers at the time of delivery. 

This information may point to systematic delays in executing delivery schedules that may point 
to a lack of delivery capacity or user mishandling of the information provided by the routing 
system. Such information may be the basis for justification of capacity expansion or training. 

[0052] In particular, as shown in Figure 5a, the degree-days, executed routes, and 

forecasting parameters are read at step 700. Then, starting at step 702, for each customer the 
planned delivery size is calculated (step 704), and basic delivery statistics are extracted (step 
706), and the performance information is stored (step 708). This process is repeated at steps 710 
and 712 for each customer. 

[0053] This operational audit information as well as the descriptive statistics generated 

during the modeling step can be used to support many other management functions not, identified 
in this document because they fall outside the specific scope of this invention. 

5. Model 

[0054] This step adds the delivery data (e.g., gallons delivered and tank level for 

propane) on the day of its occurrence to the customer's delivery history. This information is 
used to adjust the consumption parameters of that customer. This step is typically completed 
every evening on the set of customers delivered during the day. 

[0055] As shown in Figure 6, this process reads in the degree-day history (step 800) and 

customer delivery history and customer attributes (step 802). The system then performs three 
processes: 

• a statistical estimation of consumption parameters for those customers who have a stable 
and extensive history (step 804), 

• a clustering of the population of those statistically estimated customers into groups 
distinguished by specific attributes (step 806), and 

• a consumption matching of customers, whose consumption characteristics cannot be 
statistically estimated, to clusters of statistically estimated customers with similar 
attributes (steps 808 and 810). 

The resulting forecasting parameters are written at step 812. 
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[0056] The overall process of modeling is shown on Figure 6 and the logic of relevant 

modeling steps and the associated pseudo-code are shown on Figures 7 and 8. Only steps 804- 
808 require textual elaboration and follow in that sequence below. 

Statistical estimation (step 804) 

[0057] The statistical estimation is conducted with standard time series techniques that 

may include Winter's model, Box- Jenkins techniques or multiple regression methods. The 
purpose of this estimation is to identify and calculate statistically significant parameters that can 
be used to evaluate future consumption and its variance. In the propane industry, for example, 
these techniques are appropriate for about 50% of the customer population. 

Clustering (step 806) 

[0058] The logic of the clustering step is shown in Figure 7a with the associated pseudo- 

code in Figures 7b and 7c. The objective of the clustering methodology is to stratify the 
population of customers with statistically estimated consumption parameters. This stratification 
defines a number of sub populations characterized by specific attributes (e.g. tank size, 
appliances), which are associated with an appropriate set of consumption parameters that 
represent the entire population. 

Step 1 : Customers whose consumption parameters have been statistically estimated (step 900) 
are grouped by consumption type (e.g., heat, non-heat and mixed) and clustered using a chain- 
map procedure (step 902). Specifically, each group is sorted in ascending order of tank size and 
the difference between a customer tank size and that of its predecessor in the sorted ranking 
order is calculated. This difference represents a metric for similarity in customer tank sizes. 
Boundaries between clusters of tank size are then established when this difference exceeds a pre- 
set threshold. 

Step 2: With this cluster definition, the distribution of consumption parameters of all the 
customer of each cluster is estimated (step 904), and a typical level of consumption for the 
cluster is defined as a high percentile (e.g. 65%) of that consumption rate distribution. The 
estimate is biased to be higher than the expected consumption in order to take a conservative 
position in regards to stock-outs. To determine the 65 th percentile the parameters for a cluster are 
first calculated. Individual cells are then defined by equally dividing these ranges. Starting from 
the lowest cell, which includes the lowest value of the parameters, the percentile is incremented 
by adding neighboring cells that are included when a parameter range is incremented by one 
step. For heat and time dependency, this increment only affects one parameter. For the mixed 
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case, the increments alternate between heat and time parameters. When the accumulated 
percentile reaches 65% the parameter values, which correspond to that level, are associated with 
the cluster under analysis. 

Step 3: Finally a "catch-all" cluster is defined for each consumption type. The consumption 
parameters specific to the clusters of a given consumption type are then defined as high a 
percentile (e.g. 65%) of the consumption parameters for the entire population of customers of 
that consumption type. At step 906, the average consumption for each consumption type is 
estimated. 

Step 4: In addition to clusters of consumption type and tank size, this process provides a list of 
customers sorted by consumption level and coefficient of variation for each consumption type. 
These lists are used to estimate variance in the consumption matching process (steps 1010- 
1014). The coefficient of variation for each consumption type is sorted at step 908. 

Consumption Matching (step 808): 

[0059] The logic of the consumption-matching step is shown on Figure 8a with the 

associated pseudo-code in Figures 8b-8d. 

[0060] The results from the clustering process are used in the consumption matching 

procedure that is designed to assign consumption parameters and variance to customers whose 
delivery data does not permit reliable statistical estimation. This is predicated on the notion that 
customers with similar consumption types and similar tank sizes tend to consume gas at 
comparable rates. 

[0061] Specifically, the following matching procedure is applied to each customer for 

whom the statistical estimation was not possible (heretofore referred to as "customer under 
analysis"): 

Step la: The consumption type and tank size of the customers under analysis (step 1000) are 
used to select the appropriate cluster to represent it. It is then assigned the set of consumption 
parameters that characterize this cluster. 

Step lb: If the tank size of the customer under analysis is determined at step 1002 to be missing 
or out of range it is assigned the characteristics of the "catch-all" cluster of their consumption 
type (step 1004); otherwise, the tank size is matched to the nearest cluster at step 1006. At this 
point, if appropriate, the seasonality characteristic of the cluster is also associated with the 
customer under analysis (step 1008). 
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Step 2: The variance of the customer under analysis is then estimated using steps 1010-1014: 

• The consumption parameters of the customer under analysis which were assigned in 
steps la or lb above are used to build a temporary cluster of approximately 40 
customers who have statistically reliable parameters. These customers must have the 
same consumption type as the customer under analysis and their consumption rate 
must fall within a predefined interval around the consumption rate of the customer 
under analysis. (Step 1010) 

• The distribution of the coefficient of variation of consumption in this temporary 
cluster is then evaluated (step 1012) and the estimation of a high percentile of this 
distribution provides an estimate for the calculation of the variance of the 
consumption of the customer under analysis (step 1014). Again in this step the high 
percentile reflects a conservative assumption regarding stock-out conditions. 

[0062] At the end of these modeling steps, every customer should have 

1. A means to forecast expected consumption in the future (i.e., forecasting parameters) 

2. An estimate of its daily variance. 

[0063] These terms, along with descriptive statistics are the only requirements to run the 

selection process (Figure 2a above). 

Additional embodiments for applications other than propane delivery 

[0064] Fuel Oil: Fuel oil is used exclusively for heating. This simplifies the statistical 

regression analysis required for forecasting consumption because non-heating usage parameters 
can be dropped from the regression. 

[0065] ATM cash replenishment: ATM replenishment forecasting generally has 

lengthy time series available for forecasting. Therefore, clustering techniques for short time 
series are generally not needed although they may still be useful for setting up schedules for new 
installations. Also, the physical dimensions of cash canisters are fixed so that the truck capacity 
needed to make a delivery will not vary with the timing of the delivery. However, since ATM 
replenishment often occurs concurrently with ATM maintenance the scheduling algorithms may 
be driven by two demand factors instead of one. 

[0066] Field Equipment Preventive Maintenance and Servicing: The consumable is 

time and not a delivered commodity. Therefore, physical truck capacities are not a factor in 
limiting routes although time on the route will still be constrained. Furthermore, the equipment 
failure probabilities over time may be specified by the equipment manufacturers and not be 
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derived statistically from failure history, so as in the ATM case the scheduling algorithms will be 
driven by two demand factors instead of one. 

[0067] Those skilled in the art will also appreciate that numerous other modifications to 

the invention and numerous other applications for "consumable" products are possible within the 
scope of the invention. Accordingly, the scope of the invention is not intended to be limited to 
the exemplary embodiments described above, but only by the appended claims. 
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